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Abstract 

The variable domains of antibodies and T-Cell receptors (TCRs) share similar structures. Both molecules act as sensors for the 
immune system but recognise their respective antigens in different ways. Antibodies bind to a diverse set of antigenic 
shapes whilst TCRs only recognise linear peptides presented by a major histocompatibility complex (MHC). The antigen 
specificity and affinity of both receptors is determined primarily by the sequence and structure of their complementarity 
determining regions (CDRs). In antibodies the binding site is also known to be affected by the relative orientation of the 
variable domains, VH and VL. Here, the corresponding property for TCRs, the vp-Va orientation, is investigated and 
compared with that of antibodies. We find that TCR and antibody orientations are distinct. General antibody orientations are 
found to be incompatible with binding to the MHC in a canonical TCR-like mode. Finally, factors that cause the orientation 
of TCRs and antibodies to be different are investigated. Packing of the long Va CDR3 in the domain-domain interface is 
found to be influential. In antibodies, a similar packing affect can be achieved using a bulky residue at IMGT position 50 on 
the VH domain. Along with IMGT VH 50, other positions are identified that may help to promote a TCR-like orientation in 
antibodies. These positions should provide useful considerations in the engineering of therapeutic TCR-like antibodies. 
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Introduction 

The immunoglobulin fold provides the scaffold for many 
different proteins with diverse a set of functions [1]. The 
immunoglobulin domain consists of two |3 sheets arranged in a 
sandwich motif. Many protein structures consist of multiple 
immunoglobulin-like domains. These domain types are particu- 
larly prevalent in the immune system of vertebrates. 

A key task of the immune system is to specifically recognise 
molecules that are potentially pathogenic or foreign to the organism. 
Examples of components that enable this are B-cell receptors or, in 
their soluble form, antibodies. These are able to bind to antigens 
without the aid of other cellular machinery. The portion of the 
molecule that mediates antigen binding, the variable fragment (Fv), 
consists of two immunoglobulin domains, VH and VL. 

In contrast, another component of the immune system, the T- 
cell receptor (TCR), binds only to peptide antigens and only when 
they are presented on the surface of a cell by the major 
histocompatibility complex (MHC). However, like the antibody, 
the TCR binds using its variable region that consists of two 
domains, Va and Vfi, which are analogous to the antibody VL 
and VH domains. 

Each of the variable domain types have three CDRs that are 
generally characterised as loop structures [2-4]. It is these six 



CDRs that are the primary determinants of antigen specificity and 
affinity. A fourth hyper-variable region is also found on the V|3 
domain [5] . However, this is not thought to interact directly with 
the epitope. Each variable domain type is expressed from a 
different locus on the genome and is generated from a 
combination of genes. This allows large repertoires of potential 
receptor sequences to be generated. The antibody Vk and VA, 
(collectively VL) and the TCR Voc domain are generated from two 
gene types named variable (v) and joining (j). The antibody VH 
and TCR V(3 domains are built from v and j genes along with an 
additional gene type, the diverse (d) gene. The additional 
variability that the d gene brings is reflected in the sequences 
and structures of the third complementarity determining region 
(CDR) loop in VH and V(3 domains [6]. 

Recently, the way in which the antibody variable domains are 
orientated has been recognised as a determinant of antigen 
binding [7] . The variable domain orientation affects how the CDR 
loops are positioned relative to one another and therefore 
influences the geometry of the antigen binding site [8,9]. Antibody 
humanization studies have found that mutations to framework 
residues distant from the binding site can act to regain the affinity 
of the original murine antibody [10]. This effect has been 
attributed to structural changes in the Fv of the antibody. One 
effect may be to change the relative orientation between the 
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Author Summary 

The immune system needs to be able to sense molecules 
that might be harmful to the organism. Such harmful 
molecules are known as antigens. Two classes of receptor 
proteins that mediate antigen recognition are antibodies 
and T-Cell receptors (TCRs). Antibodies are able to bind a 
diverse range of antigen shapes whilst TCRs are specialised 
to recognise a cell-surface protein, the pMHC. Antibodies 
that bind the pMHC are rarely created naturally. However, 
such TCR-like antibodies are of therapeutic importance. 
The binding regions of the TCR and the antibody have very 
similar three dimensional structures. Both consist of two 
independent units, domains, which associate and form the 
antigen binding site between them. This work examines 
how the two domains orientate with respect to one 
another in TCRs and antibodies. Our results show that the 
conformations that exist in TCRs and antibodies are 
distinct. Consequently it is difficult for an antibody to 
bind to a pMHC in the same way a TCR would. However, a 
similar conformation can be achieved in antibodies as in 
TCRs by the presence of certain amino-acids in the domain 
interface. This knowledge should aid the development of 
therapeutic TCR-like antibodies. 

domains [8,11,12]. In addition to its biological importance, 
predicting the VH-VL orientation is also considered one of the 
challenges in modelling antibody structures [7,13]. 

We and others have studied the geometry of the variable 
domain orientation in antibodies [14-17]. Distinct sets of 
orientations can be identified in antibodies and attributed to 
sequence differences on the domain-domain interface. One 
example of a functional difference between antibodies with distinct 
domain orientations is related to the size of the antigen they bind 
to [15]. Antigen size is also related to the level of variable domain 
orientation conservation found in sequence-identical structures 
[16]. Whilst antibodies can bind to a highly diverse set of antigenic 
shapes and sequences, TCR epitopes are relatively similar to one 
another. Therefore, although similar in function, the binding sites 
of the two molecule types are under different pressures. Antibodies 
must be versatile in binding to many different targets whilst the 
TCR antigen binding site can optimise to associate specifically 
with the shape of MHCs. 

Although the diversity in potential T-Cell epitopes is much 
smaller than for antibodies, variation is present in the MHC class, 
allele, and the bound peptide [18]. MHC class I molecules 
(MHCI) are present on the surface of most cells. The binding 
groove consists of a single chain and is closed at both ends. This 
enables MHCI to present peptides of typically nine residues in 
length. MHCI peptides are of intracellular origin. The MHCI is 
attached by one trans-membrane region to the cell surface. In 
contrast, MHC class II molecules (MHCII) form their binding 
groove with two chains. The binding groove of MHCII is open at 
both ends allowing for peptides of lengths of 20 or more residues to 
be presented [19], MHCII peptides are derived from extracellular 
proteins. The MHCII is attached with two transmembrane regions 
to the cell surface. Even within the same MHC class, sequence 
differences allow for a specific binding repertoire of peptides and 
TCRs [20]. Currently 8,124 human MHCI and 2,409 human 
MHCII alleles are known [21]. Despite these differences MHCs 
share the same overall fold with the peptide presented above an 
anti-parallel beta sheet floor and flanked by two kinked alpha 
helices. This allows the TCR binding site to be highly structurally 
conserved. 



The relative similarity between T-Cell epitopes is also reflected 
in the structure of the complex formed between the TCR and 
MHC/peptide. Typically, the TCR is found to bind diagonally 
with respect to the MHCs peptide groove. This geometry of 
interaction is referred to as the canonical binding mode. Those 
cases where the TCR is positioned in a different orientation are 
known as non-canonical [22]. Although no strict definition for 
canonical association has been made, a number of TCRs that bind 
unusually have been found to be related to autoimmune responses 
[23,24]. Ensuring a canonical binding mode may therefore be 
important for preventing the recognition of self- antigens. 

Naturally, T-Cell and B-Cell receptors perform different 
functions in the immune system. An example of a non-natural 
interface between the two receptor types are TCR-like antibodies. 
These antibodies are specific for T-cell epitopes. Given that they 
bind to a component of the organism, they are not normally found 
naturally in the body. However, in diseases where T-cell 
functionality is inhibited, engineered TCR-like antibodies provide 
a method for delivering cytotoxic drugs and induction of infected 
cell apoptosis [25,26]. An advantage of using a TCR-like antibody 
as a therapeutic over a soluble recombinant TCR [27] is their 
higher natural affinity to an antigen [28]. Subsequendy, a lower 
level of affinity enhancement is required using techniques such as 
phage display. In addition, the serum half-life of an antibody is 
measured in days to weeks, whilst that of a TCR is of the order of 
hours [29]. 

Antibodies able to recognise T-cell epitopes must mimic a TCR 
and replicate their binding site properties to enable pMHC 
specificity. Producing TCR-like antibodies is a difficult procedure 
inhibited by the process of isolating the desired MHC/peptide 
epitope [26]. However, advances in phage-display technology 
have allowed the generation of such molecules for diagnostic and 
potential therapeutic purposes [30-32]. Successfully generated 
TCR-like antibodies share some similar pMHC-binding properties 
with TCRs that have the same specificity [32,33]. For example, 
mutations to certain positions on the MHC disrupt both TCR and 
antibody binding [32]. However, previous comparisons between 
individual antibodies and TCRs have also reported that the 
receptors use different features to achieve pMHC specificity 
[32,34,35]. This suggests that antibodies do not necessarily mimic 
TCR canonical binding. The therapeutic importance of TCR-like 
antibodies motivates the investigation of how they may be 
engineered to exhibit improved MHC specificity. Whilst the 
TCR's CDR residues primarily mediate MHC recognition, the 
orientation between the TCR Va and V|3 domains has previously 
been proposed to contribute in determining T-Cell epitope 
specificity [36]. 

In this study we investigate the Vp-Vot domain orientations in 
TCR variable regions and compare them to the analogous 
structural space in antibodies. Functional reasons for differences in 
the two receptor types are proposed by analysing the influence of 
domain orientation on the structural properties of a TCR-MHC 
complex. Given that the orientation space is different we 
investigate which sequence positions may be contributing factors 
and suggest residues on the domain interface of antibodies that 
may induce a more TCR-like orientation. 

Materials and Methods 

Dataset 

All X-Ray crystal structures containing a paired TCR ot and f> 
chain were extracted from the protein data bank (PDB) [3 7] using 
the international ImMunoGeneTics information system (IMGT) 
[38] . IMGT numbering was retained for each of the TCR variable 
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domains. Those 92 structures with a resolution of less than 3 A 
formed the full redundant dataset. This set contained 49 TCRs 
bound to MHCI, 15 bound to MHCII and 28 unbound. A 
sequence identity filter of 90% was applied over the TCR variable 
domains using cd-hit [39] to form a non-redundant set of 39 
structures. Nineteen were bound to MHCI, 1 1 to an MHCII and 
9 were unbound. 

A non-redundant antibody dataset was created using the 
structural antibody database (SAbDab) [40]. Again, a sequence 
identity filter of 90% was applied to the full sequence of the 
variable domains using cd-hit. The non-redundant antibody set 
consisted of 441 structures. The IMGT DomainGapAlign tool 
[41] was used to apply IMGT numbering to the variable domains 
in each of the structures. 

The structures in SAbDab were filtered to identify TCR-like 
antibodies bound to MHCs. A BLAST search [42] for the 
sequence of the MHCI and MHCII was performed using a 
database created from the antigens in SAbDab. Manual inspection 
of these top hits and a keyword search for phrases similar to "TCR 
like antibody" identified a total of 4 structures. One of the 
structures (3hae) is higher affinity mutant of another (3gjf) from the 
same experiment [33]. Therefore 3 cases were identified for TCR- 
like antibodies in complex with an MHC. 

Rationale for domain equivalence 

To compare the variable domain orientations of antibodies and 
TCRs it is necessary to define which domains are equivalent in the 
two receptor types. Is VH equivalent to VP or Va? 

Comparing the mean sequence identity between the domain 
types showed that Vk and VA. (collectively VL) domains were 
more similar to both Va and VP than VH was to either. 
Therefore, sequence identity alone provides no clear indication. 

To compare the use of the VH-Va/VL-VP or the VH-VpVVL- 
Voc equivalence we examined the residue conservation at 
individual IMGT positions in the sets of domains. To check the 
use of VL/Va and VH/VP equivalence we identified IMGT 
positions where the VH and Va domains have the same conserved 
residue whilst in VL domains a different conserved amino-acid is 
present. Positions such as these suggest that the alternative 
equivalence should be used. These positions were identified for 
both equivalences i.e. VH-Vp/VL-Va and VH-Va/VL-Vp. A 
position was deemed to be conserved if it had the same amino-acid 
in 50% or more sequences. Five positions support making VH- 
Voc/VL-VP equivalent whilst 10 positions support making VH- 
Vp/VL-Va equivalent. This equivalence is also supported by the 
similarity in the combination of genes that make up corresponding 
domains. Both VP and VH are generated from v, d and j genes 
whilst Va and VL are built from just v and j genes. We therefore 
use the VH-Vp/VL-Va equivalence herein. 

Variable domain orientation root mean square deviation 

The variable domain orientation root mean square deviation 
(RMSD) is a measure of the difference in orientation between two 
structures. As an example, the orientation RMSD between two 
TCR structures (Tx and Ty) is calculated as follows. Tx and Ty 
are structurally aligned using the Cot coordinates of their shared 
Vol domain framework positions. Tx's VP domain is then 
independendy aligned to Ty's VP domain. The RMSD is 
calculated between Tx's VP domain in its native orientation and 
the transformed position. The same procedure is also performed 
using the VP domains for the initial alignment and calculating 
RMSD between Va domains. The mean of these two values 
describes the difference in the relative orientation. All structural 



alignments were performed using the Biopython SVD super- 
imposer [43]. 

The orientation RMSD was calculated for each pair of 
structures in the non-redundant antibody set, the non-redundant 
TCR set and between the sets. These measurements were used as 
distances between the structures allowing them to be clustered 
using a complete-linkage hierarchal clustering method [44]. 

Applying the ABangle methodology to TCRs 

Whilst relative measures of orientation differences such as the 
orientation RMSD give a magnitude to changes in pose they do 
not reveal how structures vary. The ABangle methodology was 
developed to characterise the orientation between the VH and VL 
domains in antibodies. It gives six absolute measures, five angles 
and a distance, to fully describe the orientation between the two 
domains. The same method can be applied to any set of 
structurally defined homologous domain pairs. The process is 
presented in detail in Dunbar et al 2013 [16] and can be 
summarised as: 

1 . Define the most structurally conserved positions for both 
domains (coresets). 

2. Fit frames of reference through interface positions using 
coordinates from multiple structural alignment. 

3. Define consensus (mean) structures for both domains. 

4. Compute or choose the pivot axis about which to measure 
orientation. 

5. Define six measures about the resulting coordinate system to 
describe orientation. 

This process was followed for the TCR structures in our dataset. 
The coreset positions are listed in the supplementary material. To 
allow for direct comparisons to be made with antibodies, the pivot 
axis, C, was chosen to be the same as in the antibody ABangle 
work. However, the pivot axis was also calculated for TCRs and 
has been used in a separate study [45] to investigate variation in 
structural properties of the TCR-MHC/peptide complex using 
molecular dynamics simulations. Whilst the two axes are similar, 
the TCR-derived one lies closer to the centre of the domain- 
domain interface than the antibody derived one. 

The resulting coordinate system allows for 6 absolute measures 
of orientation to be defined, five angles and a distance. The 
distance, dc, is the length of C. HL describes a torsion angle of one 
domain with respect to the other and is similar to the packing 
angle defined by Abhinandan et al [14]. HC1 and LCI are angles 
that describe the tilt of one domain towards the other. HC2 and 
LC2 describe twisting-like differences between the orientations of 
variable domain structures. In each of these cases the ABangle 
antibody nomenclature has been retained. 

The MHC-TCR docking angle 

TCRs are often observed to bind to the MHC in a diagonal 
mode with respect to the peptide binding groove. Qualitative 
observations have labelled this association geometry as the 
canonical binding mode. The binding geometry can also be 
described quantitatively using the MHC-TCR docking angle [46]. 
This is defined as the angle between the major axis of the peptide 
and the vector between the Ca atoms of the interface cysteines 
(IMGT position 104) on the Va and VP domains. We calculated 
the docking angle for both TCRs and antibodies bound to an 
MHC. A canonical angle for TCRs is defined as being between 
40° and 85°, a range similar to that found by previous studies 
[22,47]. 
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Measuring the effect of variable domain orientation in a 
TCR-MHC complex 

The effect that changing the VP-Va orientation has on the 
structure of a TCR-MHC complex was investigated. A single 
TCR-MHC-peptide complex was chosen (PDB lmi5 [48]) as the 
native structure. The MHC is of class I and presents a peptide 9 
residues in length from the Epstein Barr Virus. Both the MHC and 
TCR are human in origin and the structure is solved at a 
resolution of 2.5 A. The docking angle of the TCR-MHC is 51.4° 
and is therefore considered to be in the canonical range. A set of 
20 non-redundant structures were chosen from across the antibody 
orientation space (Table SI in Text SI). Similarly, a set of 20 
structures was sampled from the non-redundant TCR dataset 
(Table S2 in Text SI). These two sets are referred to as the 
antibody and TCR decoy sets respectively. 

The native complex was compared to structures made when the 
VP-Vot orientation was changed to assume the poses in the TCR 
and antibody decoy sets. The following protocol was used to 
change the VP-Va orientation. 

Given a decoy structure, the VL domain (or Va domain for 
TCR decoys) was aligned to the Va domain of the native complex 
(Figure la). This structural alignment was performed using the 
shared framework positions of the two domains. The native VP 
domain was then transformed independently to assume the 
resulting pose of the decoy VH domain (or VP domain for TCR 
decoys). Thus, the Vot domain is in a native position relative to the 
MHC-peptide complex whilst the VP domain inherits the 
orientation of the decoy (Figure lb,c). 

After remodelling the residue side chains with Scwrl4 [49], 
clashes were identified between the VP domain in its decoy 
position and the MHC /peptide chains using the program 
Molprobity [50]. Any pair of residues with a CP — CP atomic 
distance of less than 7 A were also recorded. In addition, the 
DOPE score [5 1] of the entire complex was calculated to assess the 
energetic change in modifying the domain orientation. The 
process was then repeated in an analogous way to change the Va 



position to assume the decoy orientation whilst maintaining a 
native VP pose. 

Results 

vp-Va orientations are different from VH-VL orientations 

Antibody and TCR structures were clustered based on their 
orientation RMSD (Figure 2). The TCR and antibody structures 
fall into separate clusters with very little mixing of the two types. 
This demonstrates a difference in the variable domain orientations 
of TCRs and antibodies. The orientations of the TCRs alone fall 
into two distinct clusters. One might suspect that this may be 
related to MHC type or species. However, neither cluster has a 
strong bias for either property. Furthermore, no clear general 
characteristic can be identified to separate TCRs with these 
different conformations. In general, TCR binding orientation 
appears to show little relationship to VP-Va orientations, however 
the scarcity of non-canonical binding examples makes it hard to 
draw any specific conclusions. 

Whilst the RMSD measure shows that the orientations of 
antibodies and TCRs have distinct conformations it does not show 
us how the structures differ. We therefore used the ABangle 
methodology to identify structural differences. 

ABangle measures reveal how antigen receptors differ 

ABangle's six orientation measures were calculated for each 
structure in the dataset. Figure 3 shows the distributions for both 
antibodies and TCRs. A similar magnitude of variation is observed 
in the orientations of the two sets of structures. However, as with 
the RMSD measure of orientation, a clear difference in 
orientation preference is seen between the two structure sets. This 
is best characterised in the HC2 angle where the distributions are 
significandy different (Kolmogorov-Smirnov test p-value 
2.2x10 *). In fact, antibodies are almost never observed to 
reach the extreme HC2 orientation seen in TCRs. Two of the 
other orientation angles, LCI and LC2 are also significandy 




Native orientation 



TCR orientation 



Antibody orientation 



Figure 1. Changing the Vp-Va orientation of a TCR/MHC-peptide complex, a) The native complex 1mi5. No residue clashes are observed 
between the TCR and the MHC-peptide. b) The complex with the TCR placed in the V(3-Vo( orientation of another TCR structure. A single clash (black 
line) is identified between the V(3 domain and the MHC. c) The complex with the TCR placed in the VH-VL orientation of an antibody. Multiple residue 
clashes, shown by black lines, are found between the V(5 domain and the MHC. 
doi:10.1371/journal.pcbi.1003852.g001 



PLOS Computational Biology | www.ploscompbiol.org 



4 



September 2014 | Volume 10 | Issue 9 | e1 003852 



Variable Domain Orientations in Antigen Receptors 



o 



03 



< 

Q 

if) 

(Z 

c 
o 

'■4—1 

-1—1 
!Z 

cu 



CD 



CM 



o J 




TCR 



Antibody 



Figure 2. Clustering of the antibody and TCR structures by their relative orientation RMSD. The two types of structure fall largely into 
different clusters. 

doi:10.1371/journal.pcbi.1003852.g002 



different (p-values 7.1x10" and 8.6x10" respectively) whilst 
the two receptor types do not have significandy different HL 
torsion angles or HC1 bend angles. The distributions of the dc 
length are significantly different with a p-value of 3.46 x 10 s . The 
difference in orientation between antibodies and TCRs can be best 
described as a twisting-like change of the variable domains with 
respect to one another. Figure 4 demonstrates such a structural 
difference in orientation between the antigen receptor types. 

Antibody orientations are incompatible with binding in a 
TCR-like mode 

For such similar molecules in terms of their function and 
domain structure, it is somewhat surprising that TCRs and 
antibodies have such different orientations. As discussed previous- 
ly, TCRs bind specifically to peptide antigens only when they are 
presented by MHCs on the surface of a cell. In comparison, 
antibodies are not restricted in the same way and bind to a far 
more varied set of antigenic shapes. 

To investigate whether the variable domain orientation is likely 
to affect function we calculated the influence that changing the 
VP-Vot orientation had on the structure of a single TCR-MHC 
complex PDB lmi5). Two sets of orientation decoys were used, 
one from antibody structures and the other from other TCR 
structures (for full details see the materials and methods section). 

For both of the decoy sets, the number of Vot-MHC /peptide 
residue clashes and number of VP-MHC/peptide residue clashes 
were counted using Molprobity. No clashes were observed 
between either domain and the MHC /peptide chains in the 
native complex. For the TCR decoy set medians of 3.5 and 5 
clashes were found for the Vo( and VP domains respectively (4 and 



8.5 before side chain rearrangement). In comparison, medians of 5 
and 8 clashes were induced for the equivalent domains using the 
antibody decoys (10 and 14 before side chain rearrangement). A 
Mann- Whitney U test found the increase in the total number of 
clashes to be statistically significant (p-value 0.005). The increased 
number of clashes between the proteins suggests that antibody 
orientations are incompatible with binding in a canonical TCR- 
like mode. For the native complex and the re-orientated complex 
sets, the mean CP-CP contact distances were calculated. The 
contact matrices for each set are shown in Supplementary Figure 
S5 in Text SI. 

The difference between the antibody-orientated complexes and 
the TCR-orientated complexes can also be measured using the 
energetics of the structures. Supplementary Figure SI in Text SI 
shows the distributions of the DOPE score relative to the native 
state. Making the assumption that the native state approximates an 
energy minimum, we find that both sets of complexes are 
transformed to less favourable, higher energy states represented 
by higher DOPE scores. However, the set of complexes with 
orientations assumed from other TCRs are found to be 
significandy closer to the native state score than those with 
antibody orientations. 

These results suggest that binding in a TCR-like mode requires 
the receptor to have a TCR-like variable domain orientation. 
Although the VP-Vot orientation is at least as variable as the VH- 
VL orientation, TCR conformations are generally compatible 
with binding to the MHC in a canonical mode. In contrast, the 
space of antibody orientations is such that the canonical mode 
becomes more difficult to obtain. Despite this observation, 
antibodies may be engineered such that they bind to MHC- 
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Figure 3. The ABangle orientation measures for antibodies and TCRs. The receptor types are found to have different orientations. The 
difference is best characterised by the HC2 angle. This corresponds to a twist of the VH or V(3 domain with respect to the VL or Va domain 
respectively. 

doi:10.1371/journal.pcbi.1003852.g003 



peptide epitopes. These molecules are known as TCR-like 
antibodies. 

Orientations of TCR-like antibodies 

The orientations of each of the TCR-like antibodies found in 
SAbDab were calculated using the ABangle methodology. 
Supplementary Figure S2 in Text S 1 shows where each of these 
lie compared to the background antibody distribution. There 
appears to be no preference for their variable domain orientation. 
However, one structure, PDB 3cvh, has an orientation that is 
considered extreme for an antibody. It has an HC2 angle that 
leans towards the domain orientations of TCRs. 

To determine whether these TCR-like antibodies behave 
similarly to TCRs we investigated the geometry of the complex 
they form with the MHC using the docking angle (see materials 
and methods). Most of the antibody complexes do not bind in a 
canonical manner (Table 1). Instead, they bind diagonally but 
appear to use residues in the VL domain in the same way as a 
TCR uses the VP residues. Likewise the VH domains sit in a 
similar position to Vol domains in the canonical TCR-MHC 
complexes. 

Only one of the TCR-like antibodies, 3cvh, binds in a similar 
mode to the TCR structures and with the expected equivalence. 
As discussed above, 3cvh, is the only antibody that had an variable 



domain orientation that approached that seen in the TCR space. 
Although only a single example, this structure suggests that in 
order to be able bind in a TCR-like mode to the MHC, a TCR- 
like variable domain orientation should be promoted. 

Factors for promoting a TCR-like VH-VL orientation 

Given that variable domain orientation may be related to the 
functions of antibodies and TCRs, we investigated which factors 
give rise to the structural differences between them. The type of 
antigen an antibody binds does not determine its absolute VH-VL 
orientation [16] (Figure S6 in Text SI). Consequently, peptide- 
binding antibodies are no more similar in orientation to TCRs 
than general antibodies. The largest sequence variation in both 
receptor types occurs in the CDRs. Insertions also occur in these 
regions and cause variation in CDR loop structures length. The 
CDR3 in each domain type is likely to be the most influential loop 
as it makes the most inter-chain contacts. 

Supplementary Figures S3a and S3b in Text SI show the length 
distributions of CDR3 in VL/Vot domains and VH/VP domains 
respectively. The CDR3s in both VH and VP have a wide range of 
lengths. Their length distributions are similar and are both centred 
on a length of 12 residues. The CDR3s of VL and Vot have very 
different length distributions. The antibody loop (VL CDR3) is 
almost always 9 residues long whilst CDR3 in the TCR domain 
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Figure 4. Two views showing the difference in variable domain orientation between antibodies and TCRs. The VH domain (red) and VL 
domain (green) of the antibody structure 3hzk is shown in its native orientation. The same antibody is also shown in the orientation assumed from 
the TCR structure 3qiu. In this case the VH domain is shown in cyan. The difference in variable domain orientation can be described as a twist of one 
domain with respect to the other and is best characterised using our HC2 twist angle. 
doi:10.1371/journal.pcbi.1003852.g004 



(Vol) is longer with a modal length of 13 residues. The VL/Va 
CDR3 loop is partially packed in the domain-domain interface. 
The fact that Va CDR3 is generally longer than VL CDR3 can be 
related to the observed difference in the HC2 twist angle (Figure 
S4 in Text SI) between TCRs and antibodies. The location of the 
loop at the interface means that in order to compensate for a 
larger number of residues TCRs tend to reduce the HC2 angle as 
the interface twists open. 

To test the effect of the VL CDR3 length on orientation in 
antibodies we identified all non-redundant structures in SAbDab 
with CDRL3 loops of 13 residues. Nine such antibodies exist 
(Table S3 in Text SI). Eight of them have orientations that are 



typical of antibodies. This would suggest that having a long 
CDRL3 loop alone does not promote a TCR-like orientation. One 
antibody, 3B5H10 Fab, (PDB structures 3s96 and 4dcq [52]) has 
an orientation similar to that of the TCRs. However, four of the 
other antibodies in this set have the same light variable germline 
subgroup (mouse IGLV3), are at least 95% identical sequences, 
and have identical L3 loops to 3B5H10. They are paired with 
different heavy chains suggesting that it is VH interface residues 
that give rise to the difference in orientation. 

To isolate these positions, we compared the VH framework 
residues involved in contacts in the five IGLV3 structures (Table 
S4 in Text SI). Fifteen VH positions are involved in contacts in all 



Table 1 


The docking angle for each TCR-like 


antibody/MHC complex. 






PDB 


Heavy Chain 


Light Chain 


Docking Angle 


3cvh 


H 


L 


48.9° 


3cvh 


Q 


R 


49.3° 


3gjf 


H 


L 


122.3° 


3gjf 


M 


K 


125.1° 


3hae 


H 


L 


124.7° 


3hae 


I 


G 


124.9° 


3hae 


0 


N 


126.2° 


3hae 


T 


S 


123.6° 


1w72 


H 


L 


139.6° 


1w72 


I 


M 


140.3° 


doi:1 0.1 371/journal.pcbi.l 003852.t001 
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the structures. In one position, IMGT 50, the antibody with the 
TCR-like orientation is a phenylalanine whilst the other four have 
leucine at the same position. This large aromatic residue makes 
contact with the CDRL3 loop and is packed in the periphery of 
the interface. There are nine other structures in SAbDab with 
non-identical sequences that have phenylalanine at the same 
position. In each case, the residue has been mutated from the 
germline leucine. Seven of these have small HC2 angles (<115°) 
similar to that observed in TCRs. This suggests that making a 
mutation from leucine to phenylalanine at IMGT position H50 
pushes the VH-VL orientation towards a more TCR-like 
conformation. 

IMGT position 50 in V(3 domains is also predominantly leucine. 
In TCRs the orientation therefore appears to be achieved using 
other interface positions. To examine how a VH-VL interface 
could be made more TCR-like, we examined the relative amino- 
acid frequencies at interface positions. Framework positions were 
identified that are conserved in both TCRs and in antibodies (> 
50%) but with a different amino acid in each set. Ten positions 
were identified and are listed in Table 2. In order for a position to 
provide a feasible target for engineering it is preferable for a 
potential mutation to have been observed naturally. We therefore 
consulted the Abysis database [53] to find those positions where 
the conserved TCR amino-acid is also observed at the same 
position in 2 % or more of non-identical antibody sequences. We 
find five such positions and refer to them as antibody compatible 
mutations. 

Each of the antibody compatible mutations is located on the 
periphery of the VH-VL interface. Positions from a similar region 
of the domain interface were previously found to be influential for 
variation in the HC2 measure of VH-VL orientation [16]. 
Examining the sequences of the TCR-like antibodies finds that 
only the antibody that binds in the TCR canonical binding mode 
and has the most TCR-like variable domain orientation (3cvh) has 
the conserved TCR amino acid at any of these five positions 
(glutamine at H/ P 6). When the ABangle measures for antibodies 
with the TCR amino-acid at these positions are calculated we find 
that two of the four positions, H6 and H4, select for structures with 
small HC2 angles indicative of the TCR-like orientations. 

In summary, the packing of the CDR3 of Va and VL domains 
is likely to be an influential factor for domain orientation. 
However, having a long antibody CDRL3 does not necessarily 
replicate the TCR orientation. Instead, the VH-VL orientation 
can be made more TCR-like by adopting specific amino acids at 
certain interface positions: phe-H50, gln-H6 and val-H4. 



Discussion 

The orientation between the antibody variable domains, VH 
and VL, has previously been found to be influential in determining 
the geometry of the antigen binding site [14—16]. Here, we have 
compared the VH-VL orientation to the analogous property in the 
T-Cell receptor, the VP-Va orientation. The two receptor types 
have distinct sets of orientations. Using the ABangle methodology 
[16], we characterised how the orientation differs. The best 
descriptor of orientation difference is the change in the HC2 bend 
angle. This corresponds to a twisting-like change of the VH or VP 
domain with respect to the VL and Va domains respectively. 

The functional implications of variable domain orientation were 
investigated by analysing its effect on the structure of a TCR- 
MHC complex. The VP-Vot orientation was changed to assume 
the orientation of a set of antibody structures and a set of other 
TCR structures. The contacts, clashes and energetics of the 
transformed complex structures were analysed. Antibody orienta- 
tions are found to be incompatible with binding to the MHC in a 
canonical manner. In contrast, assuming orientations of other 
TCRs disrupted the complex structure far less. The TCR and 
MHC are thought to have co-evolved over millions of years 
allowing for their interaction to be optimised [54]. Variable 
domain orientation may be one example of a structural property 
that the TCR has evolved to enhance its ability to recognise the 
MHC. Alternatively, such properties may not be encoded on the 
genome but instead selected for during thymic education of T-cells 
[55]. Therefore, wider ranges of VP-Voc orientations may be 
possible but only particular conformations are chosen as they are 
structurally compatible with MHC and co-receptor interactions 
[56]. 

Despite the apparent steric restriction imposed by their variable 
domain orientations, it is possible to engineer antibodies that bind 
to the MHC. The small number of structures of TCR-like 
antibodies bound to an MHC were examined and their docking 
angle calculated. Only one of the three cases, 3cvh, binds in the 
canonical TCR binding mode. This antibody has a VH-VL 
orientation that is similar to the TCRs. The other TCR-like 
antibodies also bind in a diagonal mode but use their VH domains 
in the same way as a TCR uses its Voc domain. Reversing the 
equivalence of the domains allows them to overcome the steric 
restrictions imposed by the variable domain orientation. The 
current number of available structures is small so it is not possible 
to make statistically robust conclusions. As more structures become 
available how TCR-like antibodies bind the MHC might be better 



Table 2. Interface positions that are conserved in both TCRs and antibodies but with a different amino acid. 



Position 


Antibody Residue (%) 


TCR Residue (%) 


Antibody Compatible Mutation? 


H/p 103 


Y (80) 


F(62) 


Yes 


H/p 100 


A (93) 


S (56) 


No 


H/p 120 


Q (76) 


P (51) 


Yes 


H/p 52 


W (95) 


L (72) 


No 


H/p 72 


K(78) 


P (69) 


No 


H/p 118 


W (90) 


F (97) 


No 


H/p 6 


E(53) 


Q(97) 


Yes 


H/p 4 


L (98) 


V (79) 


Yes 


H/p 42 


V (75) 


Y(97) 


No 


L/a 4 


M (55) 


V(72) 


Yes 


doi:10.1371/jou 


rnal.pcbi.1003852.t002 
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understood. To understand how an antibody may be engineered 
to bind specifically to the MHC in a canonical manner, we 
examined the factors that cause antibodies and TCRs to be 
distinct in the HG2 angle. 

TCRs tend to have longer Va CDR3 loops than antibodies 
have VL CDR3 loops. The Va CDR3 packs into the domain- 
domain interface and acts to open up the VP-Va orientation. 
However, antibody structures with similar longer VL CDR3 loops 
are generally not observed to share the TCR orientation. The 
length of the loop alone is therefore not predictive of the variable 
domain orientation. However, a TCR-like orientation is found in 
antibodies with particular interface residues. The most influential 
position we identified was at IMGT position 50 on the VH 
domain. Here, antibodies with a phenylalanine instead of the 
germline leucine tend to have a TCR-like orientation. The TCR 
itself predominantly has a leucine at this position. Thus in TCRs 
the interface is twisted open due to packing a longer Va CDR3 in 
the interface but in antibodies the same effect is achieved by the 
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